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Abstract. We use short divisor sums to approximate prime tuples and moments for 
primes in short intervals. By connecting these results to classical moment problems we are 
able to prove that, for any r; > 0, a positive proportion of consecutive primes are within 
^ + V times the average spacing between primes. 



1. Introduction 



Finding mathematical proofs for easily observed properties of the distribution of prime 
numbers is a difficult and often humbling task, at least for the authors of this paper The 
twin prime conjecture is a famous example of this, but we are concerned here with the much 
more modest problem of proving that there are arbitrarily large primes that are "unusually 
close " together Statistically this means we seek consecutive primes whose distance apart 
is substantially less than the average distance between consecutive primes. Letting p„ 
denote the n**^ prime, then by the prime number theorem the average gaps size Pn+i ~ Pn 
between consecutive primes is logp„. Thus we define 



so that A is the smallest number for which there will be infinitely many gaps between 
consecutive primes of size less than A + e times the average size. It is empirically evident 
that 



but this has never been proved. Up to now three different unconditional methods have been 
invented which provide non-trivial estimates for A. 

1. The Hardy-Littlewood and Bombieri-Davenport Method. In the mid-1920's Haidy 
and Littlewood ' used the circle method to obtain a conditional result which in 1965 
Bombieri and Davenport 1 1 J both improved and made unconditional. This approach can be 
interpreted as a second moment method using a truncated divisor sum as an approximation 
of A(n), the von Mangoldt function (see introduction in fJ]). The method proves 
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'in the unpublished paper Partitio Numerorum VII they proved, assuming the Generalized Riemann Hypoth- 
esis, that A < |. In 1940 Rankin [Ts) refined Hardy and Litdewood's method to show that A < ^^^'^ , 
where is the supremum of the real parts of all the zeros of all Dirichlet L-functions. In pailicular assuming the 
Generalized Riemann Hypothesis (0 = ^) this gives A < |. 
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2. The Erdos Method. By the prime number theorem we have A < 1. Erdos |4| in 
1940 was the first to prove unconditionally that A < 1. He used the sieve upper bound for 
primes differing by an even number k 

(1.4) J2 Mn)Hn + k) <{B + £)6{k)N 

n<N 

where 6(fc) is the singular series and S is a constant. By this bound there can not be too 
many pairs of primes with the same difference, and therefore the distribution function for 
prime gaps must spread out from the average. This method gives the result 

(1.5) 

The value B ^ 4 of Bombieri and Davenport Q (see also |9l or II 11 ) or B = 3.5 of 
Bombieri, Friedlander, and Iwaniec fTf', or even slightly smaller values may be used here. 

3. The Maier Method. In 1988 Maier ITSl found certain intervals (rather sparsely dis- 
tributed) where there are e"' more primes than the expected number, and therefore within 
these intervals the average spacing is reduced by a factor of e^'*'. Hence 

(1.6) A < e""^ = 0.56145. .. . 

In contrast to the first two methods, this method does not produce a positive proportion of 
small prime gaps. 



These methods may be combined to obtain improved results. Huxley I13II14I combined 
the first two methods making use of a weighted version of the first method to find 

(1.7) A < 0.44254... (using 6 = 4), A < 0.43494 .. . (using 6 = 3.5), 
and Maier combined his method with Huxley's method with S = 4 to obtain 

(1.8) A < (0.44254 . . .)e"'^ = .24846 .... 

This last result is the best result known up to now, and as we have seen uses all three of the 
previously known methods. 

For several years we have been developing tools for dealing with higher correlations of 
short divisor sums which approximate primes. Our first results appeared in |7|, and, with 
considerable help from other mathematicians, we have greatly simplified and improved 
on these results in |8|. In the former paper we had an application to small gaps between 
primes based on approximating a third moment. In particular we recovered the result il.'H . 
The method is based on the same approximation that underlies the method of Bombieri- 
Davenport, but it detects small prime gaps in a different way. In this paper we extend that 
argument to all moments and obtain the limit of this method. 

Let 7r(A'") denote the number of primes less than or equal to A^. 

Theorem 1. Let r be any positive integer. For any fixed A > (y^ ^ '^^^ ^ sufficiently 
large, we have 

(1.9) 5^ 1 », n{N). 

p„ + r-P7i<Alogp„ 



^This value only holds for k not too large as a function of A'^ in 11.41 . but this is acceptable for <1.5t . 
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In particular, for any fixed 7] > and all sufficiently large N > No^rj), a positive propor- 
tion of gaps pn+i — Pnfor pn < N are less than + ^7) log A^, and 

(1.10) A<i. 

Our results depend on the level of distribution of primes in arithmetic progressions, 
and Theorem 1 makes use of the Bombieri- Vinogradov theorem. If for primes up to N 
the level of distribution in arithmetic progressions is assumed to be A^''"* for any e > 0, 

then Theorem 1 holds with A > (-y/r — Y. Hence, assuming the Elliott-Halberstam 
conjecture that d = 1 holds, we obtain the improved result that 

(1.11) liminf 



n — *oo 



and in particular 

(1.12) A< f^-\/2] = 0.085786... < ^ 



2 J 11 

There are several improvements that can be made in our results. First, we can incorpo- 
rate Maier's method into our method. This is a straightforward adaptation of the argument 
Maier used to combine his method with Huxley's result, although the result is complicated 
by the need to prove our propositions in the next section when they are summed over arith- 
metic progressions. Second, and more significantly, we have found in joint work with J. 
Pintz better approximations for prime tuples than those used in this paper, and these lead 
to significantly stronger results. These results will appear in future papers. 

This paper is organized as follows. In section 2 we present our method and state the 
two main propositions needed in the proof. In section 3 we prove some lemmas which are 
used in the later sections. In sections 4 and 5 we prove the propositions. In section 6 we 
examine an optimization problem related to the Poisson distribution which is used in the 
proof of Theorem 1 , and finally in section 7 we prove Theorem 1 . 

Notation. We will take e to be any sufficiently small positive number whose value 
can be changed from equation to equation, and similarly C, c, and c' will denote small 
fixed positive constants whose value may change from equation to equation. We will let 
A denote a large positive constant which may be taken as large as we wish, but is fixed 
throughout the paper For a finite set ^ we let |^| denote the number of elements in A. 
We will sometimes write A = Ak if \A\ = k. For a vector H we denote the number of 
components by |H|. A dash in a summation sign indicates all the summation variables 
are relatively prime with each other, and further any sum without a lower bound on the 
summation variables will have the variables start with the value 1 . Empty sums will have 
the value zero, and empty products will have the value 1 . We will make use of the Iverson 
notation that, for a statement P, [P] is 1 if P is true and is if P is false. 

Acknowledgment. We are indebted to Andrew Granville and Kannan Soundararajan 
who have greatly clarified and simplified our method. We have made extensive use of their 
results here. We have also benefited from ideas of Enrico Bombieri, Brian Conrey, Percy 
Deift, David Farmer, John Friedlander, Roger Heath-Brown, Hugh Montgomery, Michael 
Rubinstein, and Peter Sarnak. We have used work of Dashiell Fryer, an undergraduate 
student at San Jose State University in the MARC program, who worked on properties 
of Laguerre polynomials needed in our method. The first-named author also thanks the 
American Institute of Mathematics where much of the collaboration mentioned above took 
place. In a recent preprint 1 10| Ben Green and Terence Tao proved a landmark result on 



4 



D. A. GOLDSTON AND C. Y. YILDIRIM 



arithmetic progressions of primes. One tool they used was the current Proposition 1 from 
an earUer (not widely distributed) preprint of this paper They corrected an oversight in our 
original proof which we have incorporated into our Lemma 2 and the proof of Proposition 
1. 

2. Approximating Prime Tuples 

Our approach for finding small gaps between primes is to compute approximations of 
the moments for the number of primes in short intervals, and this computation uses short 
divisor sums to approximate prime tuples. Given a positive integer h, let 

(2.1) H — {hi, h2, ■ ■ ■ , hk}, with < /ii, •■■ 7 ^fc !i ^ distinct integers, 

and let VpiH.) denote the number of distinct residue classes modulo p the elements of 7i 
occupy. We define the singular series 

,2.2, e(«,.n(i-;)"(i-^)^ 

If &{H) ^ then Ti. is called admissible. Thus Ti. is admissible if and only if Vp{Ti.) < p 
for all p. 

Letting A(n) denote the von Mangoldt function, define 

(2.3) A(n; H) = A(n + hi)A{n + /12) • • • A(n + hk). 

The Hardy-Littlewood prime tuple conjecture 1121 states that for H admissible, 

(2.4) ^ A(7i;7i) ==iV(e(7^) + o(l)), as N ^ 00. 

n<N 

(This is trivially true if H is not admissible.) We approximate A(n) as in our earlier work 
by using the truncated divisor sum 

(2.5) Afl,(n) = ^/i(d)log:^, 

d\n 
d<R 

and then approximate A(n; TC) by 

(2.6) Anin; H) = Anin + hi)AR{n + /la) • • • Auin + hk). 

For convenience we also define Afj{n; 7i) = 1 if 7i = 0. Our method is founded on the 
following two propositions which allow us to obtain information about primes. Suppose 
Hi and 0.2 are both sets of distinct positive integers that are < h, with |7ii| — ki and 
1^2! = k2, and let fc = fci + fca- We always assume fc > 1. 

Proposition!. LetH^ T^i U H2, andr= nT^al- If R = o(iVi(logi?)i-T)) and 
h < for any large constant A > 0, then we have for R, N 00, 

(2.7) J2 ^Rin;ni)AR{n;n2) ^ N{6{n) + Okil))ilogR)' . 

n<N 

Proposition 2. LetH^ HiUn2, , r = iHiHT^sl, and! < ho < h. LetHo ^HU {ho}, 
and rQ^rifho^Handro = r + lifhoe U. If R <fe [logN)-^'^''^ for a 
sufficiently large positive constant B{k), and h < R^, then we have for R, N —i- 00, 

(2.8) Ai?(n;Hi)Afl(n;H2)A(n + ;io) =iV(©(Ho) + Ofe(l))(logi?)'-°. 
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Assuming the Elliott-Halberstam conjecture, then equation J2.8t holds for R <^k 
with any e > 0. 

The restriction on the size of R in Proposition 2 can be improved in the situation when 
/lo e Hi U H2- If we let fc* = fc - |Hi n {/io}| - \U2 n {ho}\, then we see that the 
reduction of cases at the start of the proof of Proposition 2 implies that Proposition 2 holds 
in the range R <Cfc N^k^ (log Af)~^('=) except in the trivial case when k — 2 and fc* = 
where the result holds for i? < A^. In the case of the Elliott-Halberstam conjecture k can 
also be replaced by k* in the bound for R. 

We actually prove both Propositions with the error term Ofc(l) replaced by a series of 
lower order terms, which however are not needed in any of our applications. 

If we take = in Proposition 1 we have, for R ~ o{N~) and h < R^ for any given 
constant A > 0, that for R,N ^ oo, 

(2.9) ^ Afl(n;H) =7V(6(H) + Ofc(l)), 

n<N 

in agreement with the Hardy-Littlewood prime tuple conjecture J2.4> . 

In applying these propositions it is critical to have some form of positivity in the argu- 
ment. For example, in the special case when 7^2 — Proposition 2 takes the form, for 

R < 7V2r(logAr)--S(fc), 

. , X / N{e{n) + ok[i)) log R, if ho en, 

(2.10) ^An{n;n)A{n + ho)^l N(e{no) + o,{l)), ifK^U, 

n<N ^ ^ ' 

which would seem to exhibit that our approximation detects primes. However since Aj^{n; Ti) 
is not non-negative, it is impossible to conclude anything about primes from (I2.10> alone. 

On the other hand, consider instead the special case of Proposition 2 where Tii — 7^2 = 
n which gives, on taking \H\ = k, for R < Nts (logN)-^^''\ 

(2.11) }_^An{n,n) A(n + M-| iv ©(Tio) + Ofc(l))(logi?)^ if ho^H. 

The restriction on the size of R here makes it impossible to conclude from ( 12. lU that any 
given tuple Ti will contain two or more primes (see [?]), but Granville and Soundararajan 
found a simple argument which uses the non-negativity of Anin; TCf' to prove from ( I2.11> 
that 

(2.12) ^ - i' 

To prove their result, we need a formula of Gallagher that as /i — > oo,^ 

(2.13) 6in)^h'' + Okih^--^+'). 

I<hi,h2,...,hk<h 
distinct 

We fix fc > 1; the argument works equally well for any k, and we can take k = 1 if we 
wish. Suppose now that 



R = N^{logNy^''''\ /KlogTV. 



"^Granville (unpublished) and Montgomeiy-Soundarafajan flT\ have recently proved more precise results, but 
these are not needed here. 
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By differencing, equation (12.1 1> continues to hold when the sum on the left-hand side is 
over N < n < 2N, and therefore we have 

27V 

n=N+l l<ho<h 

- /c7V6(H)(logi?)''+^ + A^e(Ho)(logi?)^ 

l<ho<h 
ho^hi, l<i<k 

Also by Proposition 1 

2N 

Y AB.{n;H)^ ^ N&{H){\ogR)\ 

n=N+l 

and therefore we have on summing over all distinct tuples 1 < hi, h2, ■ ■ ■ , hk < h and 
applying ( I2.13> that, for p a fixed number and h, N ^ oo, 

2N 

J2 ( E A(n + M-plogA^)( E ^R(n;nf) 

l<ho<h l<hi,h2:---,hi^<h 

distinct 

- Nh'' (log R)'' (k \ogR + h-p log N) 

^Nh'^ilogRf {h-{p- ^) log N). 

Since Aji{n) < d{n)logR ^ n'^, we see that the contribution in the sum above from 
terms where n + /iq is a prime power is ^ Tyi/^+e which is negligible, and therefore we 
may restrict the sum over ho to terms where n + hois prime. The right-hand side above is 
positive if h > {p — j) log N, which implies with this restriction on h that there is a value 
of 71, < n < 2N, such that 

E log(n + /io) > plogiV. 

l<ho<h 
n+/io prime 

If p > 1 this implies that for N sufficiently large there are at least two terms in this sum, 
and thus by taking p ^ 1+ we obtain ( I2.12t . 

The proof of Theorem 1 is a refinement of the above argument, where we detect primes 
by the square of the linear combination of tuple approximations 

k 

(2.14) ao + Y,^i{ E Afl(n;7^j)). 

j — l l<hi,h2,...,hj<h 
distinct 

Here the a^'s are available to optimize the argument. While it is possible to use ( I2.14> 
directly, we have chosen in the proof of Theorem 1 to first approximate moments, which 
highlights the Poisson model which the prime numbers are thought to satisfy. This method 
also has the advantage of simplifying the combinatorics that occur in the problem. The 
moment method leads to an optimization problem which is familiar in the theory of or- 
thogonal polynomials, the solution of which was provided to us by Enrico Bombieri and 
Percy Deift. The final result we obtain depends on the asymptotics of the smallest zero of a 
certain sequence of Laguerre polynomials; these results are obtained by Sturm comparison 
type theorems and have appeared in the literature; Michael Rubinstein first pointed these 
out to us. 
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(3.1) 



3. Lemmas 

We recall the Riemann zeta-function defined for Re(s) > 1 by 



The zeta function is analytic everywhere except for a simple pole with residue 1 at s = 1, 
and therefore 

(3.2) C(s) - 

s — 1 

is an entire function. We need to use a classical zero-free region result. By Theorem 3.11 
and (3.12.8) of 1201 there exists a small positive constant C such that ({a + it) ^ in the 
region 

C 



(3.3) 

for all t, and further 
(3.4) ({a + It) - 



a > 1 



1 



log(|i| + 2) 
«log(|t| + 2), 



— «log(|i|+2), 



a -1 + it " ' ■ " (^{a + it) 

in this region. Let (c) denote the contour s = c + it, — oo < t < oo, and let C denote the 
contour given by 

(3.5) s 



C 



\oe(\t\ 



it. 



Lemma 1. We have, for R>2 and c > 



(3.6) 



1 



1 



27ri J C(l + s) s2 



ds = l + Oie-"''^^'), 



and for any fixed constant B 

(3.7) ^(log(|s|+2))' 
and 

(3.8) / (log(|s| + 2))^ 



< e 



R'ds 



< logi?. 



Proof. We first prove ( 13. 7> . The integral to be bounded is, for any w >2, 

^ i'^ c ( logdtl + Z)"^ 



c (log(|i| +2)) 



< (logw)^ 



(|i|+C)2 



Clog-R (logw)^ 



(logi) 

t2 



■ (it 



< (w(logti;)-^)e i°s™ ^ + 
and on choosing log w — \\/C\ogR this is 



w 



g 
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which proves ( I3.7l i. 

To prove J3.6> . we note that by the second bound in ( 13 .4> the integrand in ( 13.61 1 vanishes 
as |t| cx) in the region to the right of C, and therefore we can move the contour (c) to 
the left to £, pass the simple pole at s = with residue 1, and obtain 

If 1 i?" , If 1 , 
as = 1 + - — : / —J- ^^r-as. 



2ni J C(l + s) s2 27ri J C(l + s) s 

ic) (C) 

On £ we have from ( 13. 4> that ^^j^"';^^^ ^ log(2 + |i|), and therefore we may use the estimate 
i3J\ to obtain ( 13. 6> . Finally, the left-hand side of ( 13. 8> is 



< log-R. 

Lemma 2. Lef /_r(si,S2) analytic in the strip —B < (Ji,o'2 < 6 /or iome positive 
constants B and b, and suppose also that for any e > (si , S2 ) ^ gC\/tog^ j/j/^ ifr/p 
as \ti\, \t2 \ — > oo. For B> 2 and < ci, C2 < c, let 

(3.9) U[B) = ——- / / /fl 51,52)77— r-r— r ^ ^ dsids2. 

{2Tny J J C(l + si)C(l + S2) si^S2^ 

(C2) (Cl) 

Then 

(3.10) = /fl(0, 0) logi? + Cr + 0(e-='^^), 
where 

(3.11) CH = §^(0,0) + -i. I fj,{^s,,^s,)——^ 



Proof. We write / = //f and U = U{B) in what follows. To evaluate U, we first move 
(ci) to C passing a simple pole at si = and obtain 



1 f B''" 
/ fi^^^2)—ds2 

2-1X1 J S2^ 



(C2) 

(3.12) 1 . C(l + 51 + g2)i?^^+^^ 

"^(2^*)2 J J ^!(;(^l + s,)ai + S2)s,^S2 

= Ui + U2, 



where we have taken C2 > so that we did not pass the simple pole of C(l + si + S2) 
at si = — S2 in ^2- To evaluate Ui we move {02) to £ passing a double pole at S2 = and 
obtain by ( 13. 7> of Lemma 1 and the bound for / that 

Z^l =/(0,0) logi? +-^(0,0) + — / f{0,S2)—ds2 

0S2 2m J S2 



(3.13) 



/(O, 0) logi? + ^(0,0) + 0(e-^'^^). 
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F0TU2 we move (02) to C, passing simple poles at S2 = —si and S2 — and obtain 

2vri 7 C(l + si)C(l - sijsi J si^ 

c c 

£ C 



2 



= ^ //(-I, --1)777— ^^^^ -^+0(e-'^), 

£ 

where we estimated the last two integrals using ( 13 .4t . Lemma 1, and the bound for / stated 
in the lemma. This completes the proof. 

4. Proof of Proposition 1 

Let 

(4.1) K = |7iiU7i2|, r=\Hir\n2l fc = |Wi| + IW2I = fci + ^2, 
and therefore < r, fci , A:2 < and 

(4.2) K = k-r. 

Next, without loss of generality, we take 

H (/ii,/i2, ■ • • , hk), 

T^i — {hi, h2, . . . , ft-fci}, 

W2 = {/ifej+i, /ifej+2, ■ • ■ , hk}, 

^'^'^^ ninn2 = {hi,h2,...,hr}, 

hi = hk, /l2 = hk-l, ■ ■ ■ ,hr = hk-r+l — 

n:=niun2 = {hi,h2,...,h^}. 

Here r = when Hi n H2 = and the fourth and fifth lines in (14. 3> may be removed. 
With this notation we have 



Sk{ni,H2) - ^R{n;'Hi)KR{n;n2) 

n<N 

(4.4) ri<A' diA2,...4h<R i=l * 

di|r!+/ii,l<i<fc 



E (nMK)iogf) E 1 

di42,...,dk<R j=l * n<A' 



Let 

(4.5) Dk^[di,d2,...,dk], 

the least common multiple of di,d2, ■■■ ,dk- The sum over n above is zero unless (d; ,dj)\ hj - 
hi, 1 < i < j < fc, in which case the sum runs through a unique residue class modulo Dk, 



10 



D. A.GOLDSTONAND C. Y, YILDIRIM 



and we have 

(4.6) E l = ^+OW- 

n<N 
dj\n-\-hj,l<j<k 

Hence 

k 

Skini,H2) = N ^ f[ f^idj) log ^ + 0{R'') 

(4.7) di,d2,- dk<R j=i ^ 

(di,dj)\hj-hi,l<i<j<k 

= NTk{rti,rt2) + o{R^). 

We next decompose the dj's into relatively prime factors. Let Vik) be the set of all non- 
empty subsets of the set of k elements {1, 2, . . . , fc} (This is just the power set with the 
empty set removed.) For B E 'P{k), we let Vsik) denote the set of all members of V{k) 
for which is a subset. Thus for example if fc = 4 then 

^{1,2}(4) = {{1,2},{1,2,3},{1,2,4},{1,2,3,4}} 

Since the d/s are squarefree we can decompose them into the relatively prime factors 

(4.8) di= a^, l<i<k, 

where is the product of all the primes that precisely divide all the d^'s for which i E v, 
and none of the other dj's. This decomposition is unique and the 2*^ — 1 a^'s are pairwise 
relatively prime with each other. 

We next denote by ©(H) the divisibility conditions 

(4.9) {di, dj) = JJ hj-hi, l<i<j< k, 
and have 

(4.10) %iHun2)= E iu^]it[^^4 

di,d2,-,dk<R \vev{k) " / \j=i ^ 

17(H) 

We now apply the formula, for c > 0, 

J_r+'°°x^ Jo, ifO<x<l, 
^ ^ 27riX_,^ s^^'-X logx, ifx>l, 

and have that 

k 

(4.12) Tfe(Hi,H2) = ^ /•••/ / FisuS2,...,s,)Yl^dsj, 



where 



(Cfc) (C2) (Cl) 



(4.13) F(si,S2,...,Sfc)= n 



©(H) 



and 

(4.14) r^ = Ysj. 
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We next consider the divisibility conditions I?(H). The variables a^, indexed by the 
singleton sets ly = {j}, 1 < j < fc are not constrained by these divisibiUty conditions, 
and therefore can contain any prime as a factor. Further, if r > 1, then hj — hi = 
for j = k — i + 1 and 1 < i < r. Thus these constraints drop out of 2?(H) and the 
unconstrained variables are both the singleton sets = {j}, I < j < k, and also the 
doubleton sets v = {i, k — i + 1}, 1 < i < r. (If r — Q there are none of these doubleton 
sets.) The remaining a^, are constrained by at least one of the divisibility relations, and 
therefore must divide some hj — hi so that they can only contain prime factors < h. We 
therefore see that we can write F{si, . . . , Sk) as the Euler product, for aj > 0, I < j < k, 



k ^ 

F{si, ■ ■ ■ ,Sk) = Y[ (l ~T+7~ + /h(p;si,S2, . . . ,Sfe) 



(4.15) 

where 
(4.16) 



p<h j = l 



p>h j=l 



/H(p;Sl,S2,...,Sfc) 



1 1 



E 



i/e-p(fc),|i^|>2 

p\hj~hi for a\\ i.jf^v 



Factoring out the dominant zeta-factors we write 

F(si,S2,...,Sfe) 

(4.17) 



fj-^ C(l + Sj)C(l + Sk-j + l) C(l + Sj) 

and proceed to analyze Gh. Let 

(4.18) A:= II \hj~h,\<h''\ 

l<i<j<K 

SO that this product is over all the non-zero differences of hi and hj for \ < i < j < k. 
(Here of course A is not the same function as in the first section.) From the discussion 
above (I4.15> . /h = unless p| A, and therefore 
(4.19) 

/ 1 \ 

l~ErT+3- + /H(p;si,S2,...,Sfc) 



Gh(si,S2, . . . , Sfc) = J]^ 

where 



h{si, S2, . . . Sk), 



(4.20) /i(si,S2,...Sfc) = n 



fc ^ r 



11 V pl + Sj / 11 V pl + Sj+Sfc_3 + l 
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Let 
(4.21) 



s* = — min((Tj, 0). 
i=i 



Taking aj > I < j < k, we have 

(4.22) hisuS2....Sk)<^kl[{l + 0ki^)) «fel, 

and thus in this region we have 

Gh(si, S2, . . . , sfc) <fe n + 

p\A P 



(4.23) 



<fc exp a{k) ^ 

\ p<fc2 log(2/l) ^ 

«feexp I a(fc)(fc2log(2/i))^ 



,1-s* 



p<fc2 log(2/i) - 



<fe exp (fe(fc)(log(2/i))^* log log log 2/i) , 



where the sum which was originally over p| A has been majorized by using ( I4.18> to find 
the smallest set of primes that could divide A. By this bound and ( I4.17> we see that if 
r > 1 then F has simple poles at Si + Sk-i+i 0, 1 < i < r. By (13. 4> . for \si\ > C/2, 
(4.24) 



52, . . . , Sfc) «fe exp(&(fc)(log(2/i))^* log log log 2/i) J] log2(|t, |+2) J] ■ 



1 



Sk-i+l\ 



We are now ready to begin the evaluation of 7^ (7i i , 7i 2 ) • By ( I4.24t we see the integrand 
in ( I4.12> goes to zero as any one of the variables \tj\ 00, and thus we can move any 
contours we wish to the left to C. We first move successively the contours (cj), r + 1 < 
j < K to £; by ( I4.17> these correspond to the cases where the integrand has only a simple 
pole at Sj = 0. If r = K there are none of these terms and we skip ahead to ( I4.28> . Thus, 
moving Cr+i to £ and passing a simple pole at Sr+i = we obtain 
(4.25) 



1 

~ (27ri)'=-i 



n 

)^r+l ('^j) 



Gh(s1, S2, . . . , Sfc) 



Sr+i=0 ^ j=r+2 



Si S 



C{1 + sj+ Sk-j+i)R 



;dSj dSk-j + l 



TT — 

We bound the second multiple integral on the right by moving all the contours (cj), j 7^ 
r + 1, to ( which leaves the value of the integral unchanged. If Sj and Sfe_j+i are on 



SMALL GAPS BETWEEN PRIMES I 



13 



(logi?)' 



Cil + + Sk-j + l) 



(4.26) C(l + Sj)C(l + Sfc-j+i) 

« {\ogR)\og{2 + \sj + Sk-j+i\)\og{2 + \sj\)log{2 + \sk-j+i\). 

In the multiple integral s* = — (Tr+i < ^'^^ '^r+i on £, and we take a fixed C < 
. Then by j4.17> . j4.24L j4.26l l and Lemma 1 we have the second term in j4.25> is 

(4.27) 



<fc exp(6(/c)(log(2/i))^ 



log'(|s|+2) 
log^(|s|+2) 



( log ) 

J2 



R^ds 



(ioeh) 

«fe exp (fo(fc)(log(2/i))^) (logi?) 



R'ds 



X / log2(|5|+2) 



R'ds 



K+2r-lg-c'\/Iog7? 



<fc e" 



-c'fcVlog-R. 



where we used log 2h ^ log R for the last line. 

We continue this process, moving next {cr+2) to C in the first term, and estimating the 
secondary term as above, and so on successively through the contours up to (c„). Hence 
we conclude 



(4.28) 
where 



(4.29) 



Ur 



(2^,)2.(n / 



Gi(si, S2, . . . , Sr, Sk+2, • ■ ■ Sfe) 



(Cfc-3 + 1) (Cj) 



(:{\ + sj + sk-j+i)R'^+"'-^^' 



rdsi ds 



J asfc-i+1, 



7=1 + + Sfc-i+l)Sj2(sfc_j + l)2 

and 
(4.30) 

Gi(si, S2, . . . , Sr, Sk+1, S„+2, . . . , Sfc) = Gh(si, S2, . . . , Sr, 0, 0, . . . , 0, S„+i, 5^+2, ■ • • , S/c)- 



We will now prove that 



(4.31) Ur = Gh(0,0, . . . ,0)(logi?)'' + J2 -^j Wilog Ry-^ +Ok{' 



where the Aj{Ti.) are explicitly computable arithmetic functions which for 1 < h < R^ 
with any ^ > satisfy the bound 

(4.32) (loglog2/i)''«. 
We will also prove at the end of this section that 

(4.33) Gh(0,0,...,0) = 6(H). 
From these results Proposition 1 follows. 
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The multiple integral in Ur would decouple into a product of double integrals evaluated 
in Lemma 2 if Gi were a constant, but since this is not the case we need to apply Lemma 
2 inductively. To do this we need estimates for the partial derivatives of Gh- Let a = 
(fli, a2, . . . , flfc), and define 

Qai Qa2 Qak 

(4-34) D^Gn = ' ' ' Gh(.„ . . . , 

We have, for aj > ~c'k, 1 < J < fc, 

(4.35) DaGn <fc (loglog2/i)^'('=) exp (b{k) (\og{2h)y' log log log 2/i) . 

To obtain these estimates, we logarithmically differentiate Gh to see 

\p<fc2 log(2/i) ^ 

The sum above is bounded by 

< log(2/i))^* ^ ^ log(2/i))"* log(fc2 log(2/i)), 

p<fc2 1og(2h) ^ 

and J4.35> follows in this case by ( I4.23t . By the product rule, further partial derivatives 
will satisfy the above bound with the sum having logp replaced by (logp)'^''^'', which only 
changes the value of b'{k) in ( I4.35> . 

We first consider the case r = 1 in ( I4.29> . By Lemma 2 applied with fn = Gi we see 
by J4.23t that the conditions of the lemma are satisfied and therefore 

Ui = Gh(0, 0, . . . , 0) logi? + Ai + Okie'^'"^^), 

where 

^i = ^(0,0,...,0) + -^ / G,{s,,-s,)-——^ 

dsk 2m J C(l + si)C(l - sijsi* 



It remains to prove that Ai satisfies ( 14.32k By ( I4.35> the first term in Ai satisfies this 
bound. In the integral term we move the contour back to the imaginary axis with a semi- 
circle of radius 5 = iog(fc^ iog(2/t)) '■° ^^^^ double pole at si = 0. Using (13. 4> and 
.23> the part of the integral over the contour on the imaginary axis is bounded by 

«.(iogi„g2M'<« rMi±i))!.< 



(loglog2/i)''W 



(53 

«fe (loglog2/i)^'('=), 

and the integral over the contour on the semicircle is bounded by 

, (loglog2/i)3''('=) 
«fc ^6 X 

«fe (loglog2/i)^'W, 
which completes the proof for r = 1. 
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For the general case of ( I4.29> . we move all the contours to ( ) and apply Lemma 2 
for the double integral over si and to obtain 



(27ri)2'-2 

^ log ) ( log ) 



logi? + G2 + 0.(e-<^^) 

si=0 



(4.36) ^ (:{l + Sj+Sk-j+i)R'^+'^-'+' ^ ^ 

= logi? + 0,(e-<^^), 

where the error term was estimated using ( I4.26> as in (I4.27> . Here 



G2(S2, S3, ■ • ■ ,Sr, S^+l, Sk+2, . . . , Sfc-l) = — (0, S2, S3, ... , S^, S^+i, S„+2, . . . , Sfe_i, 0) 

OSk 

1 /" ^ / \ '^■^l 
+ 7^ / Gi(Si, S2,...,Sr,S„+i,..., Sfc-l, -Si)— — — r-^, 

2Tn J C(l + si)C(l - si)si* 

c 

and therefore U'r-i is of the same form as Ur^i with Gi replaced by a partial derivative 
of Gi or an absolutely convergent integral of Gi with respect to the variable si when 
Sfc = — si. As we saw in the case r — 1, both of these terms satisfy ( I4.32> and ( I4.35t . We 
now apply Lemma 2 for the double integral over S2 and Sfc_i, and continue this process 
until all the variables are exhausted. We thus arrive at ( 14.3 1> and the bound ( I4.32> follows 
by ( 14.351 1 and the argument used in the case r = 1. 

It remains to prove ( I4.33> . By ( I4.19> and (I4.20> we have 



GH(0,0,...,0) = n(l-- + /HfeO,0,...,0)) (l--) l[fl-^\fl-l 

p|A ^ ^ /VP/ V PJ \ P 

where by ( 14.161 1 

/H(p;0,0,...,0) = i ^ (-l)l"^l. 

^ iye-P(fc)>|>2 

p\hj — hi foralH,jGi/ 

Therefore by i2.2\ we need to prove 

(4.37) (-l)!"! =fc-^^p(H). 

Iye-P(fc)>|>2 
p\hj—hi for all i^j^y 

If Vp{7i) ~ q, then hi, h2, ■ ■ ■ , hk must fall into q distinct residue classes, say ri, r2, . . . , 
(modp). Let 

Mp{£) — {m : = rf {modp)}, 1 < £ < q. 

Thus given p the sets Aip{£), I < i < q, give a disjoint partition of the set {1, 2, ... , k}, 
and therefore 

(4.38) ^|Xp(^)|=fc. 

The conditions p\hj — hi hold if and only if hi and hj are in the same residue class modulo 
p and thus if and only if i and j are in A4p{£) for some £. Hence the ly S Vik) which 
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will satisfy p\hj — hi for all i,j £ v are precisely the subsets of Mp{£) with at least two 
elements 

Mp= \J{iy:i^ClMp{i),\iy\>2}. 

1=1 

We conclude, using ( I4.38> , that 



p\hj~hi foralH.jG?^ 



E E (-1)'^' 

e=l i^CMpil),\i^\>2 
1=1 j=2 \ J / 

j2(-^ + \Mp{i)\] 



?=1 



= EI-^pWI- 



£=1 

= k-Vp{U). 

5. Proof of Proposition 2 
We first reduce the proof to the special case when ho ^ H = Hi U 7^2- Let 

(5.1) S{ni,n2,ho) = E Afl,(n;Hi)A;j(n;H2)A(n + /io)- 

n<Af 

Since trivially |Aij;(n)| < d{n) \ogR, we see for i = 1, 2 

(5.2) Aji(n,H,) < (d(n)logi?)^% 
and since d{n) <^ and in Proposition 2 R < N^, we have 

5(Hi,H2,/io) = E Afl(n;Hi)Afl(n;7i2)A(n + /io)+0(i?'+') 

R<n<N 

(5.3) 



E Afl,(^; Hi)Afl(n; 7^2)A(n + ho) + 0(iV^+^ 



R<n<N 
n-\-ho prime 

where we have removed the prime powers in the last line. If n + ho is a prime > R then 
its only divisor < Ris d ~ 1, and therefore 

Ai?(n + ho)A{n + ho) = A{n + ho) log R. 

Thus if ho G Hi n H2, 

S{Hi,H2,ho) 

= (logi?)2 E AR{n;ni~{ho})ARin;n2-{ho})login + ho) + OiN^+y, 

R<n<N 
n+iiQ prime 
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if ho e H1-H2, 

5(Hi,H2,/io) =logi? Afl(n;7^i -{;io})Afl(n;H2)log(n + /io) 



R<n<N 
n+ho prime 

+ oiNi+y, 

and if ho^HiU H2, 

5(Hi,H2,M= Afl(n;Hi)Afl(n;7^2)log(n + /io) + 0(7V3+^). 

R<n<N 
n+ho prime 

In these sums we may once again include the terms < R and the prime powers if we wish 
with the same error term, and therefore in each situation we have reduced the proof to the 
case when ho is distinct from the other /i,'s. Henceforth we can therefore take 

(5.4) ho ^ n. 
Proceeding to the proof, we have 

(5.5) s{n,,n2,ho)^ Y (n^(^')iogf) E Hn + ho). 

di,d2,...,dk<R i=l ' n<N 

di\n+hi, l<i<k 

By the Chinese Remainder Theorem the sum will run through an arithmetic progression 
modulo Dk provided (d;, dj)\hj ~ hi, 1 < i < j < k and will be empty otherwise. As in 
(14. 9> we denote these conditions by P(7i). Using Iverson notation, we let 

(5.6) V(a;;g,a):= Y Hn) = [{a,q)^l]--^ + E{x;q,a), 
and have 

Y Hn + ho) = [V{n)]{iiN + ho;Dk,a)-^b{ho,Dk,a)) 



C5 7^ "-^ 

' di\n+hi, l<i<k 



[V{HmN-Dk,a) + 0{h\ogN), 



where a is an integer satisfying the congruence relations a = ho — hj (mod dj), 1 < j < k. 
The term ip{N; Dk, a) has a non-zero main term if (a, Dk) — 1, which is equivalent to 

(5.8) {dj,h,-ho)^l,l<j<k, 

and, if (a, Dk) > 1 then ^{N; Dk, a) < (log A^)^; thus 

s{nun2,ho)^N Y 77:^(n^(^^)i°gf 

di,d2,---dk<R ^ i=l ' 

V(H) 

{dj .hj — ho) — !. l<j<k 

^^■^^ +0{ J2 (t[^^'id^)log^) max \EiN;Dk,a)\) 

dud2,...dk<R 1=1 {a,Dk) = l 

+ 0{R''h{logN)^) 
= Nfk{ni,n2,ho) + 0{£k) + 0(i?'=/i(log A^)2). 
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We handle the error term £k with the Bombieri- Vinogradov Theorem. First, 

(logi?)*'- V fi'iDk) max \E{N;Dk,a)\ 

■^■^^ a(mod -Dfc) 



— a(moa q) — 
q<R'' („ „)-i q=Dk 

^ diM2,...,dk<R 

Given q, the number of ways to write q = Dk (i.e. write q as the least common multiple 
of k squarefree numbers) is bounded by d{q)'^, since each of the k numbers in the least 
common multiple must be a divisor of q. Applying Cauchy's inequality, we have 

ffc«(logi?)^- V ^lHq)d{q)'' max \EiN;q,a)\ 

— a(mod q) 
1<R^ (a,g) = l 



d{q) 



2k 



\ ^ q 

\ q<R'' 



V q max \E{N;q,a)\^^ 

A a (mod q) 



We now use the estimate 

(5.10) ^d(n)'=«fca;(logx)2'-i 



and the trivial estimate \E{N; q,a)\ < N_k^N_ ^ < to conclude 



£k^k {logR f+'^^NlogN 



max \E{N;q,a)\. 

\ ~t, "(mod q) 
\ 1<R {a,q) = \ 

By the Bombieri- Vinogradov Theorem the sum on is ^ i^o^N)^ ^^"^ ^'^y A > provided 

(5.11) <N^{\ogN)-^ , 
where B — B{A). We conclude under this condition that 

(5.12) £k «A,- A^(log7V)4'+fe+|-4 = Ok{N) 

if A > 2(4''' + fc + i). To complete the proof of the proposition we will now prove that, 
fori?''' <Nandh< , 

(5.13) fk{ni,n2,ho) = Tk+i{Hi U {ho},n2) + Okie-^"^^'), 
which by ( 14.281 1. ( 14.321 1. and ( I4.33> completes the proof. To prove ( I5.13> . we have 
(5.14) 



Tk+i{HiU{ho},n2) = 



do.di,--- ,dk<R j=0 ■' 

{di ,dj)\hj -hi ,0<i<j<k 



5^ (nM(rf.)iog 



di,d2,--- ,dk<R. J = l 

(di,dj)\hj-hi, l<i<j<k 



E 



n{do) R 
■log- 



do<-R. 
{dQ,dj)\hj—hQ^ I'^j'^k 



[do,Dk] do 



J2 (nMrf.)log J)Ti(HiU{/io},H2) 



,d2,--- ,dk<R j=l 
■D{H) 



dj. 
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On letting g = (do, Dk), do — gd\ we see [do, Dk] = Dkd' and {d' ,Dk) = 1. Thus 

d'Dk gd' 



gd'<B. 

{g,dj)\hj~ho, l<j<k 
(d\Dk) = l 

ff<fl d'<i?/g 
9\Dk (d',Dfc) = l 

(g,(ij)|hj-/io, l<j<k 

For log TO ^ log i? we have (by the prime number theorem or see Lemma 2.1 of Q) 

(d,m) = l 

Applying this and dropping the redundant condition g < R since g < Jlj^iCffi'^j) — 
< R when /i < R^/^, we see 

(g,dj)|?ij-/io, l<j<k 



We now claim that 



= [(cfj, - ho) = 1, 1 < J < fc]. 



(3,dj)|/ij-/io, l<j<k 

One way to see this is through the decomposition of the d^'s into relatively factors ( 14. 8> 
from which we see we can write g — Yluevik) ^^'^ b^\ai, with the bi, pairwise relatively 
prime with each other. Then the sum becomes 

fe 

n n E /^(^-) = [(^j' - '^o) = 1' 1 < J < fc]- 

J = l fe'P{j}(fe) b„|(cij,/t3-/io) 



We conclude 

0(A 



ri(7^i u {ho},n2) = tttttK^j'^j - /io) = i, i < j < fc] 



and on substituting this result in (I5.14t we have 

k 

di,d2,-dk<R '^^ ^' 7 = 1 ^ 



11, 02, •••a*:; 

V(k) 



Ofe((logi?)''^e-^iVi°g«A'= ^ 



d(i?fc). 



di,d2, ...cifc<i? 
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The first term is 7'k{Hi,H2, ho) and by ( I5.10> 

di,d2,...dk<R q<R'' ^ q=Dk 

dig)"'-' 
«fc (logi?)2 . 

Thus the error term is 



which proves J5.13I I if h < R"^ 



6. Optimization of a Quadratic Form Related to the Poisson 

Distribution. 

The content of this section and the proof given here was provided to us by E. Bombieri 
and P. Deift. The final tool we need for our proof of Theorem 1 is an optimization procedure 
related to the Poisson distribution. Let X be a Poisson random variable with expected value 
A, defined by the discrete p.d.f. 

(6.1) pU) = Prob.(X = j) = e-^^, j = 0, 1, 2, ... . 

We define an inner product with respect to this density function by 

oo 

(6.2) 

The fc-th moment of the Poisson distribution is defined by 



oo 



, \ k OO . j 



More explicitly, we have 



(6.4) Mfc(A)=^| I |A^ 
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where | ''^ J- denote the StMing numbers of the second type, defined to be the number of 



V 

ways to partition a fc-set'* into v non-empty subsets (not counting the order of the subsets). 
It is easy to see that 



(6.5) 



fc 1 r fc- 1 1 r fc- 1 



since the last element in our fc-set either is put into its own singleton set or else it is put into 
one of the v subsets which contain some of the earlier elements. To prove ( 16. 4> we use the 
identity 

This identity arises from counting the number of partitions of a fc-set into less than or 
equal to j sets, where the order of these sets is counted. On one hand there are j choices 
for where to place each of the fc elements, so this number is , while on the other hand, if 

V of these j sets are non-empty, then there are j/! < ^ I such partitions and { \ ways 



V 

to choose the v non-empty sets. Rewriting ( 16. 6> in the form 

f ^ \ k\ 1 



multiplying by A-' e"^ and summing over j, we obtain by (16.31 



3=0 

OO 

-A 



3 1 



fc 1 1 



u=0 

by interchanging the j and summations, which proves ( 16.41 1. 

Our method for finding small gaps between primes leads us to define a second bilinear 
form given by 

{f{x),g{x))p = {x- p, f{x)g{x)) 

(6.7) 



(i - P)f{3)9{3)p{3) 



where p is a real number. (This is not an inner product because it is not necessarily non- 
negative.) Letting a = (ag, ai, 02, ... , a^), consider 

k 

(6.8) Pa(x)=^a,a;\ 

i=0 



"^A fc-set denotes a set with fc elements. 
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and the associated quadratic form 

Q = ga(A,p) = (Pa(a;),PaW>p 

= ^ aiaj{x - p,x''^^) 

0<i,j<k 
0<ij<k 
0<ij<k 



(6.9) 



where we define 
(6.10) 



Cm(A, p) = ^m+i(A) - pUmW- 



The optimization problem we need to solve is to maximize Q over all vectors normal- 
ized by afc = 1 when p > is fixed. The solution involves the (generalized) Laguerre 
polynomials defined for a > — 1 by 



(6.11) 



n + a \ 
n — V ) v\ 



The zeros of the Laguerre polynomials are real, positive, and simple, (see Chapter 6 of 
1191 .) We denote the smallest zero of Ln*-"-* (x) by xi (n, a). The solution of our problem 
is obtained in the following proposition. 

Propositions. Foreachk > landp > kfixed, we have forO < A < xi{k + 1, p~ k — 1) 



(6.12) 



max (3a(A, p) — —{k + 1)!A 



k ^k+l 



(A) 



Thus, for each k > 1 and p > k 

(6.13) inf {A > : (5a(A,p) > 0, = 1} = xi (fc + 1, p - fc - 1). 

The proof of this proposition will ultimately reduce to evaluating the determinant 



(6.14) 



Dk = det 



Co 


Cl 


C2 


Cfe 


Cl 


C2 


C3 


. . Cfc+l 


C2 


C3 


C4 


. . Ck+2 



det [c,+j]i=o,i.2,...,fc ■ 

j=0,l,2,...,fc 



Cfc Cfc_|_i Cfc+2 . . . C2fc 

The solution of the optimization problem can be obtained by choosing a so that Pa.{x) is 
orthogonal to all lower degree polynomials with respect to ( , )p. Thus we consider the k 
equations 

(6.15) (Pa(x),a:')p = 0, i = 0, 1, 2, . . . , ~ 1, 

and prove the following lemma. 

Lemma 3. If Dk-i =/= Q for a given A, then there is an (explicitly obtained) vector a with 
— 1 which satisfies (|^^} and for which 

Dk 



(6.16) 



D 



fc-i 
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Proof. We take at — I- Equation (I6.15> is equivalent to the equations 

k 

(6.17) ^ajc.,+j=0, i = 0, 1,2, . . 1. 

If a satisfies these equations, then with 5ij denoting the Kronecker delta, we have 

k I k 



(6.18) 



On rewriting (j^^J in the form 

coao + ciai + 0202 + • ■ ■ + Ck-iat-i = -Cfc 
cioo + C2ai + 0302 H h CkO-k-i = -Cfc+i 




(6.19) 



Cfc-iOo + CfcCi + Cfc+ia2 H h C2A;-2afe-i -C2fe-i 

we have by Cramer's rule (see 1211 ') that these equations have the solution 

(6.20) a, j = 0,l,...,fc-l, 

provided that Z3fe_i 7^ 0, where D^^^^ is the determinant with the i\h column of D^-i 
replaced by the column (cfc, Ck+i, ■ ■ ■ , C2fc-i). Thus ( I6.18t gives with this choice 

(6.21) Q = J2 Dk-i^cu+j + Dk-iC2k 

On the other hand, if we expand into its cofactor expansion along the bottom row we 
see 

A; 

Dk = J2'^-l)'+'Dk+i,,+iCk+j, 
i=o 

where the minor Dij is the determinant of the matrix where the zth row and jth column of 
Dk is removed. From ( I6.14> we see 

Dk+i^j+i = i-l)''-'-'D['+^'\ 1 < J < fc - 1, Dk+i,k+i = Dk-i 

where the factor (—1)'^^^^^ results from shifting the last column of Dk by fc — j — 1 places 
to the left. Hence we conclude 

Dk 

(6.22) Q = 1T^- 
Our next lemma evaluates Dk- 

Lemma 4. We have 

(6.23) Dk-i = i-l)''V. 2! 3! • • • fc! Lk^P~''\\). 
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Proof. We first claim that 
(6.24) 
where 



(6.25) Ek = det 







Dk-i = {- 




Mo 


Ml 


M2 




Ml 


M2 


M3 


Mfe+1 


M2 


M3 


^4 


Mfc+2 



Mfe-1 Mfc Mfe+i 



1 



M2fe-1 



det 



;=o.i,2....,fc-i 

j=0,l,2,...,fc 



for if in Ek we multiply the £-th column by p and subtract this from the {£ + l)-th column 
k we obtain 



for^ = 1,2, 



det 



Mo 


Co 


Cl . . 




Ml 


Cl 


C2 . . 


Cfe 


M2 


C2 


C3 . . 


Cfe+1 



Mfc-1 Cfc-l Cfc ... C2fe-2 

1 ... 



and using the cofactor expansion along the bottom row gives Ek = {—l)''Dk-i- 
We now introduce the differential operators 



(6.26) 

Clearly we have the relations 



dX dX dX 



fc-i fc-i 

i=i 3=0 



(6.27) 



where aj{X) and bj{X) are polynomials of degree j in A. Now by ( 16. 4> and (16. 5> we have 
/ife = A/ifc_i, and in general 



(6.28) 

From this we see 
(6.29) 

Mo 
Amo 
AVo 

. fc-i , 



Pfc = A'Mfc-., , < i < fc; pfc = A''!. 



E'fe = det 



Ml M2 
Api Ap2 
AVi AVs 



A'^^Vo A'^-Vi A'''-V2 



1 



P 



Mfc 
Afik 



= det 



A' 



Mi 



:=o.i,2....,fc-i 

i=0,l,2,...,A: 



By the second relation in \b.21\ we can replace A' by 5^ and a linear combination of lower 
powers of d, which can be climated by row operations. Thus we can replace A by (5 in the 
above determinant without effecting its value, and then by the first relation in (I6.27> and 
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row operations we can replace 5' by A*i?' which on removing the factors of A in each row 
gives 



(6.30) 



k(k-l) 

Ek = X 2 det 



fc(fe-i) 
= dot 



/Uo Ml 
DfiQ D^ii Dfi2 
D^fio D^fH D^a 



D^-'^a D^-'^i D''-'n2 
1 p 



Pk 
Dpk 
D^Pk 

D'^-^Pk 



i=0,l,2,...,fc-l 

j=oa,2,...,fc 



We next need the relation 

(6.31) q{q-l)---{q~h + l) = hl 



3=0 



where 



h 
j 



are the Stirling numbers of the first type, although we do not need to use any 
properties of these numbers. Then we have by ( 16. 3> 
(6.32) 



A'' = e^X^e-^ 



°o \q+h 



q=0 



A« 



-A 



g=0 \j=0 

Thus, using column operations we see 



El. = A^^ det 



(6.33) 








" h ' 




_ j 






A 



A« 



h 
j 



A2 
D^X^ 

^fe-i^ ZJfe-iA D'^-iA^ 



1 



Dl DX 
D^l D^X 



k(k-l) 

X—^ det 



1 

D'X^ 
P 
3 



2! 



A'^ 
DA*= 
D^A'^ 



i=0,l,2,...,fc-l 
i=0,l,2,...,fc 



Expanding along the bottom row we see 



(6.34) det 



D^X' 
P 
3 



:=0,1.2,...,fc-l h=0 
J=0,1,2,...,A: 



V(-1)'=-''W ( ^ det[i?^A^"] ,=o,i....,fc-i ■ 



We will show below that 

(6.35) Eh = det[i?'A^] ,=o,i....,k-i = 1! 2! • • • (fc - 1)! 
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which then gives on retracing our steps 

k 

Dk-i = (-1)'^A^'^1!2! ••• {k-l)\^{-lf-''h\ 



\k — h 



(-l)'^A^^l! 2! • • • klLk^P-'^^X), 



which proves Lemma 4. 

We prove ( I6.35> by the following argument shown to us by Wasin So. We consider the 
complete upper triangular matrix 

/I A A2 . 

Dl DX DX^ . 
DH D^X D^X^ . 



M =[D'X^U=o,i....,k 

j=0.1,...,k 



A'^ \ 
DX'' 
D^X'' 



D'^xy 



x^- 



i=0.1,...,fc 



Observe that det A/ = 1! 2! • • • A:!, and further that 
M = TP, where T = [S,,il ]^=o^,...,k, 

3=0,1, ...k 

Now by the matrix inverse formula using minors 



P 



x^- 



i=0,l....,fc 
j=0,l,...,k 



detM j=o,i,...,k 



where Fh occurs in this matrix as the minor Di, h ■ Further 



where 



T ^ — [Sij— ]i=o,l,...,k, 
j"=0,l,...,fc 

where we used the identity 

k 



= (-1)^4 



A^- 



j=0,l,...,fe 
j=0,l,...,k 



From this last relation we see, letting M ^ = [mij], 



Fh = (~l)''+HietM)mHk = (-1)'+' 



.(-1) 



k-h 



k\ 



-1!2! ••• fc! 



k~h 



as desired. 

Proof of Proposition 3. Let a be the solution for ( I6.15> found in Lemma 3, which exists 
for any A where Dk-i ^ 0, and let b be any other fc-vector with 5fc ~ 1. Then P\,-JyX) is 
a polynomial of degree fc — 1 or less, and by the orthogonality property ( I6.15t 

gb(A, p) = (Pa(a;) + Pb-a(2:), Pa(x) + Pb-a(a^))p 

= (Pa(a;),Pa(a:)>p + (Pb-a(a;), Pb-a(x))p 

= (3a(A, p) + Qb-a(A, p). 
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In general by i6.7i and J6.9l l for any c ^ 0, assuming p > is fixed, 

oo 

= -p((Pc(x))2,l)+0c(A) 
< 0, for 0<A<Ao(c,p) 

where Ao(c, p) is a small positive constant depending on c. Thus 

Ob(A,p) <Qa(A,p) 

for < A < Ao(c), proving that Qa is maximal at least for small enough A. This will 
continue to be true for larger A as long as Qc < for any (fc — l)-vector c, and therefore 
as long as the maximal Q for (k — l)-vectors is negative. By (5.1.14) of Szego 1191 we 
have 

-^L„(")(x)=-L„_i("+i)(a;), 
ax 

and therefore we see the sequence {Lfe''' '^^} of Laguerre polynomials has the property 
that the negative of the derivative of a term is the previous term. (Thus the negative deriva- 
tive of the Laguerre polynomial in the numerator in ( I6.12t is the Laguerre polynomial in the 
denominator.) Further this sequence of Laguerre polynomials all are decreasing functions 
up to their first positive zero, and hence the sequence of smallest positive zeros xi{k, p— k) 
is a decreasing sequence. Starting with the trivial case when = 1 we see successively 
that the Qk with a satisfying ( I6.15t will be maximal for < A < xi{k + 1, p — k — 1). 
This completes the proof of Proposition 3. 

Our next result evaluates the smallest positive zero xi {n, a) asymptotically as n ^ oo. 

Lemma 5. Let L„'"^(a;), a > —1, denote the Laguerre polynomials. The zeros of 
Ln^'^\x) are real, positive, andsimple. Letxi{n,a) denote the smallest zero of Lr}-°^\x). 
If a = (3{n) — n and lim„^oo ^^p- = ^ > 0, then 

(6.36) lin, 



Proof. The properties of Ln^"^ {x) may be found in Szego fT9l . Equation ( I6.36> is a special 
case of Theorem 4.4 of 1 3 1. A simple proof may be obtained by using the same argument 
found in [16J where a result corresponding to (I6.36> for Jacobi polynomials is proved using 
Sturm comparison theory. By (5.1.2) of Szego, the differential equation 

has u = e"^/2a;("+^)/2L„^"^(x) as a solution. Let 

n+{a + l)/2 1 



Hnix) 



X 4x2 4 

- {An + 2{a + l))x + [a^ - 1) 
4^?2 ' 
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and denote the smaller root of the quadratic in the numerator by x„ . Then by the Sturm 
comparison argument in 1161 . and noting lim„^co ^ = j4 — 1, we have 

xi{n,a) Xn~ 
lim — lim 

n — *oo fi n — *oo Ti 

2n + a + 1 - \/4n2 + 2a + 2 + Ana + An 
= lim 

n — >co n 

a / a 
= 2+ lim A/ 4 + 4 lim - 

n^oc n y n^oo n 

= If. 

7. Gaps between primes. 

In this section we prove Theorem 1 . We want to examine statistically the number of 
primes in the interval (ri, n + h] for N < n < 2N with N oo. In this range the average 
distance between consecutive primes is log N, and thus we will take /i to be a multiple of 
this length. We therefore let 

(7.1) V'(a;) = ^A(n), 

n<.x 

(7.2) ^P{n,h) = ij{n + h)~^{n), 



(7.3) h = X\ogN, 
and in this paper we assume that 

(7.4) A < 1. 

The model for our method is due to Gallagher ||5l, who proved that if the Hardy-Littlewood 
conjecture (12. 4> holds uniformly for h <C log N then one can asymptotically evaluate all 
the moments for the number of primes in intervals of length h. Thus assuming i2A\ . 
Gallagher proved that 

2N 

(7.5) Mk{N,h,^):^———^ mn,h)f-M>^), 

^ S ' n=N+l 

as N oo, where /ife(A) is the Poisson moment from i6.3\ and i6.4l . 

In order to obtain unconditional results we make use of our approximation Afl(n; H). 
Taking N < n < 2N, we first need to approximate 

^{n,hf = ^ A(n + /ii)A(n + ^2)---A(?i + /ife) 

l</ll,/l9,...,/!.fc</l 

(7.6) - 

= (1 + 0(1)) [log Nf-\^\K{n-n). 

To define our approximation, we extend the definition of A/f(n; H) in J2.3t to vectors (or 
lists) H = {hi, h2, ■ ■ ■ , hk). The distinct components of the vector H are the elements of 
the set Ti = {/ii, /12, . . . , /ife}, and we define 

(7.7) Afi(n;H) (logi?)^-l^lA,^(n; H). 
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Then our approximation of ip{n, h) is 

(7.8) i,R^''\n,h) := ^ Afl(n;H). 

For convenience we also define 'tpR^'^\Ti, h) — 1. We next define the approximate mo- 
ments, letting k — i + j, 

2N 

(7.9) M,,(i?) = — — ^R^'\n,hW'\n,h). 

^ *= > n=N+l 

and note that Mqq{R) = 1. We also need the mixed moments 

2N 

(7.10) M.,(i?)= .fc+i E 

^ ^ n=N+l 

for which we note by the prime number theorem that AIoa{R) ~ |- = in accord 

with ( 17.1 H and ( I7.13> below. 

Using Propositions 1 and 2 we will prove asymptotically that these approximate mo- 
ments are also Poisson moments with an increased expected value involving the truncation 
level R. Define 9 by 

(7.11) R = N'^. 

Proposition 4. As N ^ oo we have, for k = i + j > 1 and for any fixed < 6 < j, 

(7.12) M,,(i?) = (l + Ofe(l))Mfc Q 
and for any fixed Q < 9 < 

(7.13) M,j{R) - (l + Ofe(l))/ife+i ' ^ 

Proof. By differencing. Propositions 1 and 2 continue to hold unchanged when we sum 
for N < n < 2N. We first extend Proposition 1 for vectors Hi and H2. Recalling the 
notation |H| which denotes the number of components of the vector H, let fc = |Hi| + 
IH2 1, H = Hi U H2, where Hi is the set of distinct components of H^. Then by il.li and 
Proposition 1, (note the k in Proposition 1 and 2 is equal to \Hi\ + \H2\ here), we have for 
R = o{Ni), 

2N 

Afl(n;Hi)Afl.(n;H2) 

(7.14) 2^ 

= (logi?)'=-|«i|-l«^l ^R{n;Hi)AR{n;H2) 

n=N+l 

= N{e{H) + o{l)){logR)''-^^l 
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Thus we see this result depends on k and not the individual values of |Hi| and IH2I. Hence, 
letting hi,h2, ■ ■ ■ ,hk list the components of Hi and H2 (in any order), we have 



2N 



^ =" ' l<hi,h2,...Jik<hn=N+l 

(6(H) + o,(i))(iogi?ri«i, 

l<hi,h2,...,hk<h 

provided R — o{Ni). We group terms in this sum according to the number of distinct 
values 1/ of hi, h2, ■ ■ ■ , hk, and denote these distinct values by h'l, h'2, ■ ■ ■ , h'^. There are 
{^} ways to partition the k /i^'s into these ly disjoint sets, and all of these will occur in the 
sum above. Hence by ( I2.13> we have 



A^..(^;)-E|n/^'^(i+ofe(i))(iogi?)- 



= (i + ofe(i)K Q 

which proves the first part of Proposition 4. The second part is proved identically using 
Proposition 2. 
Now consider 



(7.15) 



Sk=Sk{N,R,X,p) 

2N 

^WWi?l^ ^ W^^h)~p\ogN){P,ii:^{n,h))y 

^ *= ' n=N+l 



where 

fc 

(7.16) PkMn,h)) = ^a,V'i?/'H«,/i)(logi?)'~', 

e=o 

and the ag's are arbitrary functions of N, R, k, A, and p which are to be chosen to optimize 
the argument. On multiplying out we have that 

27V 

(7.17) X mn,h)- ^log R)^PR^'\n,h)i:R^^\n,h) 

n=N+l 

= E aiGjMij. 

0<i,j<k 

Letting 
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we have by Proposition 4 on taking i + j ^ k and assuming < 6 < 

(7.19) =/i,+i(A) -pAi«(A) + o,(^) 

= Ck(A,p) +o^(^^), 

using the notation of (I6.10> in the last line. To evaluate Sk we need to apply these results 
for < K < 2fc, all of which wiU hold if we impose the condition 

(7.20) < ^ < TT- 

4fc + 1 - 4fc 

Thus 

(7.21) o<i,j<k 

= Qa(A,p)+Ofc(l), 

since maxi<£<fe ja^ |^ depend only on k for fixed A and p. By Proposition 3 we obtain 
a sign change for Qa{X,p) at the smallest zero a:;i(fc + 1, p — fc — 1) of the Laguerre 
polynomial Lfc+i^^"''""^^ (A), with (5a(A, p) negative for < A < xi{k + 1, p - k - 1) 
and positive for xi{k + l, p — k~l) < A < xi{k, p~ k). Therefore by ( I7.21> Sk will also 
be positive for + 1, p — fc — 1) + oa;(1) < A < xi{k, p — k) — Ok{l) as N ^ oo. We 
apply Lemma 5 with (3{k) — p; if we take sequences 6 — 6k and p — pk ^ as 

k — > oo, then A — 4r, and there exists constants < Cfc < c'k, and Ck,c'k — > 0, such that 
for iV^-^f + Ck<X<{^- \f + c'fc we have 

(7.22) Sk »fc 1, 5fe > 0. 

Note that the Laguerre polynomials are well defined by (I7.20> here since by (I7.20> p~k > 
0. The proof of Theorem 1 is now a standard deduction from ( I7.22> : we follow our earlier 
proof in the last section of ||7J. Define 

2N 

(7.23) Qt{N,h)= 1- 

ri=JV+l 
7r(n+/i) — 7r(n)>r 

If n is an integer for which 7r(n + /i) — Ti{n) > r then there must be a j such that n < pj 
and Pj+r < n + h. Thus pj+r — Pj ^ h and Pj+r ^ h < n < pj < Pj+r, so that there are 
at most h such n's corresponding to each such gap. Therefore 



(7.24) Q+{N,h)<$:rh 1 + 0(7Ve-^^^), 

N<p„<2N 

where we have used the prime number theorem to remove the prime gaps overlapping the 
endpoints N and 27V. (This can be done more explicitly as in |7j.) 
Next, we have, for N sufficiently large, 

2N 

(7.25) Qt{N,h)^ J2 1 + 0(7V^), 

n=N+l 
^{n+h) — ip{n)>plog N 
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where p can be taken to be any number in the range r < p < r + 1, and the error term is 
from removing prime powers. By (I7.25> and Cauchy's inequality we see that 



(7.26) 

Sk < 

< 



1 



N{logR) 



2k+l 



2JV 

E 

n=N+l 
ij{n+h)-tlj{n)>p log N 



N{\ogR) 



1 

2k+l 



2N 

E 



(Pa*B(n, /.)))* 



. n=N+l 
\^{n+h)-Tp{n)>p log N 



J 



2N 



\n=N+l 



< 



■''Q+{N,h) + 0{m/^) ( 



7V(logi?) 



^ ^{n,hf{Pk{^R{n,h))Y)^ 

/ 2N \ 

{Pk{^R{n,h)))' 



\n=N+l 

Hence, provided ( I7.22> holds we have 
(7.27) Q+{N,h) + 0{N^^^)-»k 



vn=JV+l 



(7V(l0gi?)2'=+l)' 



2N \ / 2N \ ' 

,=Ar+l / \n=N+l J 



We wiU prove below that subject to h log N from J7.3t and J7.4t we have 

2N 



(7.28) 

and 

(7.29) 



J2 iPin,h)^ <t:N(logNY 



n=N+l 



2N 



J2 {Pk{4^R{n, h))y « N{\ogNf\ 



n=N+l 



Therefore we conclude from M .2A\ - M .211 that for A = (-y/r ~ \Y + Ck 



(7.30) 



x-^ N 



N<p„<2N 



where Ck — > O"*" as fc ^ cx), which proves Theorem 1 . 

Before proceeding to the proofs of ( I7.28> and ( I7.29> . we note that, for N < n < 2N, 
the trivial estimates 'ip{n, h) ^ hlogN and il)ji-^\n, h) <Ca; immediately imply the 
bounds ^ N'^^'^ in (I7.28t and ( I7.29> from which (I1.10> follows. To prove ( I7.28> we make 
use of the sieve bound 



(7.31) 



J2 Hn-^Hk) < (2'-fc! + e)6{nk)N, 
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(see fm Theorem 5.7 or jo] Theorem 4 of 2.3.3). Then by equation (I7.6l i and ( 12.131 1 

2JV 2N 

i^{n;hf= (! + «(!)) E {\og Nf-\^\ K{n-n) 

n=N+l n=N+l l<hi,h2Ji3,h4,<h 

= (1 + 0(1))^ l(logiVr^ ^ ( ^ A{n;H.)) 

(7 32) distinct 

< (7V + o(7V))^|^|2^^!(log7V)'^-^( J2 

i^^l ^ l<hi,...Ji^<h 

distinct 

< (^E|^|z/!(2Ar + ej7Vlog'*iV, 
which proves J7.28> . 

The proof of ( I7.29> is based on a generahzation of Proposition 1 proved in fsl by the 
same method used in the proof of Proposition 1 in this paper For fc > 1, and H ~ 
{hi, h2, ■ ■ ■ , hr} with distinct integers hi, and a = (ai, 02, . . . a^), a; > 1 with J2l=i — 
k, let 

N 

(7.33) Sk(N, n,a)^Yl ^«(" + '^iT'^ni^ + ^2)"' ' ' ' + hrT"- ■ 
Then for max^ \hi\ < R and i? > 2 we have 

(7.34) Sk{N,n,s.) = {Ck{s^)&{n)+Ok{l))N{\ogRt-'' + 0{R^), 

where the Cfc(a) are constants that are computable rational numbers. On multiplying out 
the left-hand side of ( 17.291 1 we obtain a linear combination of (fc + 1)'' terms of the form 

2N 4 
n=N+l i=l 

for any < £1, £2, (3, d < k. Letting £ = + £3 + ^3 + £i, then < ^ < 4fc and we have 

2N 4 
l<-mi,m2,---,rni<h n— z— 1 

where mi, TO2, . . . , run through the components of the H^, 1 < i < 4. Letting H be 
the set of distinct components, we have by il.lt and il. 341 . 



2N 4 

Y n^«("'H0«fc^6(7^)(logi?)^-l«l, 

n=N+l i=l 



34 



D. A. GOLDSTON AND C. Y. YILDIRIM 



and by J2.13> 

j^l l<hi,...Jij<h 
distinct 

<fciv(log7v)4^ 

which proves il.29i . 
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